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Abstract 

We prove new lower bounds on the likely size of an independent set in a random graph with a given 
average degree. Our method is a weighted version of the second moment method, where we give each 
independent set a weight based on the total degree of its vertices. 

1 Introduction 

We are interested in the likely size of the largest independent set S in a random graph with a given average 
degree. Azuma's inequality implies that | <S'| is tightly concentrated around its expectation. Moreover, it is 
easy to see that |5| = 0(n) whenever the average degree is constant. Thus for each constant c, there is a 
constant a cr it = a cr it(c) such that 



1 a < a crit 
a > a cr it . 



Pr[G(n,p = cjn) has an independent set of size an] 

By standard arguments this holds in G(n, m = an/ 2) as well. 

Our goal is to bound a cr it as a function of c, or equivalcntly to bound 

c crit = sup{c : a crlt (c) > a} , 

as a function of a. For c < e, a greedy algorithm of Karp and Sipser 3 asymptotically finds a maximal 
independent set, and analyzing this algorithm with differential equations yields the exact value of a C rit- For 
larger c, Frieze pQ determined a C rit to within o(l/c), where o refers to the limit where c is large. These 
bounds were improved by Coja-Oghlan and Efthymiou 2 who prove detailed results on the structure of the 
set of independent sets. 

We improve these bounds significantly. Our method is a weighted version of the second moment method, 
inspired by the work of Achlioptas and Peres [4] on random fc-SAT, where each independent set is given a 
weight depending on the total degree of its vertices. 

We work in a modified version of the G(n, m) model which we call G(n, to). For each of the to edges, we 
choose two vertices u, v uniformly and independently and connect them. This may lead to a few multiple 
edges or self-loops, and any vertex with a self-loop cannot belong to an independent set. In the sparse 
case where to = cn/2 for constant c, with constant positive probability G(n,m — en/ 2) has no multiple 
edges or self-loops, in which case it is uniform in the usual model G(n,m = cn/2) where edges are chosen 
without replacement from distinct pairs of vertices. Thus any property which holds with high probability 
for G(n,m) also holds with high probability for G(n, m), and any bounds we prove on a cr it in G(n,m) also 
hold in G(n, to). 
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We review the first moment upper bound on a C rit from Bollobas [5]. Let X denote the number of 
independent sets of size an in G(n, to). Then 

Pt[X > 0] < ELY] . 

By linearity of expectation, ELY] is the sum over all (J*J sets of an vertices of the probability that a given 
one is independent. The m edges (u, v) are chosen independently and for each one u,v £ S with probability 
a 2 , so 

ELY] = ( n \l-a 2 ) m . 
\anj 

In the limit n — > oo, Stirling's approximation nl = (1 + o(l))\ / 2nn n n e~ n gives 

U ^ ~ — e nh(a) , 
an) \fn 

where h is the entropy function 

h(a) = —a In a — (1 — a) ln(l — a) , 
and where ~ hides constants that depend smoothly on a. Thus 



ELY] 



c 



n(/t(a) + (c/2) ln(l-ct ) 



For each c, the a such that 

ft(a) + (c/2)ln(l -a 2 ) = (1) 

is an upper bound on a cr ;t(c), since for larger a the expectation E[X] is exponentially small. 

We find it more convenient to parametrize our bounds in terms of the c cr it(a). Then (fTJ gives the 
following upper bound, 

a In a + (1 - a) ln(l - a) ^ n ln(l/a) + 1 
ln(l - a 2 ) 

We will prove the following nearly-matching lower bound. 
Theorem 1. For any constant x > 4/e, for sufficiently small a 

i \ . ln(l/a) + 1 a; 
Ccrit(a) > 2 ^= . (3) 



Ccrit(a) < 2 — T <2 . (2) 



We note that Coja-Oghlan and Efthymiou [2] bounded c cr it within a slightly larger factor 0(y/\n(l/a)/a). 

Inverting ([2]) and Theorem [1] gives the following bounds on a C rit(c). The lower bound is a significant 
improvement over previous results: 

Corollary 2. Let W(z) denote the largest positive root y of the equation ye y = z. Then for any constant 
V > 4V2/e 7 

2„ r /ec\ Vine 2 „ r /ec\ 

where the lower bound holds for sufficiently large c. 
If we like we can expand W(ec/2) asymptotically in c, 

' ec \ , , , , In In c 1 — In 2 



TL 



J = In c - In In c + 1 - In 2 



In c In c 

l(lnlnc) 2 lnlnc 3 + (ln2) 2 - 41n2 /(In lac) 



(2-ln2)- — + v ' + 



2 (lnc) 2 v ; (lnc) 2 2(lnc) 2 V (lnc) ; 

The first few of these terms correspond to the bound in [T] , and we can extract as many additional terms as 
we wish. 
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2 The weighted second moment method 



Our proof uses the second moment method. For any nonnegative random variable X, the Cauchy-Schwarz 
inequality implies that 

P^>0]>Sa. (4) 

Unfortunately, applying this directly to the number X of independent sets fails utterly. The problem is that 
for most pairs of sets of size cm, the events that they are independent are highly correlated, unlike the case 
where the average degree grows sufficiently quickly with n [TJ|S]. As a result, ELY 2 ] is exponentially larger 
than ELY] 2 , and the second moment method yields an exponentially small lower bound on PrLY > 0]. 

One way to do away with these correlations, used by Frieze in pQ, is to partition the vertices into sets 
Vi of |l/ a J vertices each and focus on those independent sets that intersect each Vi exactly once. Here we 
pursue a different approach, inspired by the work of Achlioptas and Peres [4] on random fc-SAT. The idea 
is to give each independent set S a weight w(S), depending exponentially on local quantities in the graph. 
Specifically, we define 

w(S) = ^ of ed S es w ) with u ' v S , 

where fj, < 1. If the number of edges m is fixed, the number of edges where neither endpoint is in S is simply 
m minus the total degree of the vertices in S. Thus we can also write 

W(S) = /i m - £ -M dcg( ^ . 

Weighting each S in this way counteracts the temptation for S to consist primarily of vertices of low degree. 
This is analogous to [4] , where satisfying assignments are given a weight that discourages them from satisfying 
the majority of literals in the formula. 

We will apply the second moment method to the random variable 

x= e 

SCV,\S\=an 
S independent 

If we tune /i properly, then for particular a* , c* we have ELY 2 ] ~ ELY] 2 , in which case PrLY > 0] is bounded 
above zero. In that case c cr it(a*) > c*, or equivalently a cr it(c*) > a*. 

To this end, let us compute the first and second moments of our random variable X. We extend the 
weight function w(S) to all sets S V by setting w(S) = if S is not independent. That is, 



where 



and 

w u ,v(S) = i 1 if u € S, v S or vice versa (5) 
[o if u,veS. 

We start by computing ELY]. Fix a set 5 of size an. Since the m edges are chosen independently, 
E[w(S*)] = wi(a, fi) m where w\(a, /x) = E, UtV [w UtV (S)} . 

For each edge (it, v) in G{n, m), it and v are chosen randomly and independently, so the probabilities of the 
three cases in J5]) are (1 — a) 2 , 2a(l — a), and a 2 respectively. Thus 

Wi(a, n) = (1 — a) 2 u + 2a(l — a) . 



X 










scv 






\S\=an 


(S) 




Yl w u,v(S) 






(u.v)eE 


M 


if 


u,v S 




if 


u € S,v S or 





if 


u, v € S . 
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By linearity of expectation, 



e[x}= nw(S)] = (")m( a ,ny 



SQV 
\S\=an 



Using Stirling's approximation, ( Q ™ n ) ~ -j=e nh ^ and substituting m = cn/2 gives 



E[X] ~ -Le n/l(a) where Ma) = h(a) + £ In Wl (a, n) . (6) 
V n 2 



As before, ~ hides constant factors that depend smoothly on a. 
Next we compute the second moment. We have 



E[X 2 } = E 



5>(S)5>(T) 

iS T 



^E[w(S)w(T)] 



S,T 



where S and T are subsets of V of size an. The expectation of w(S) w(T) does not depend on the specific 
choice of S and T, but it does depend on the size of their intersection. We say that S and T have overlap ( 
if \S fl T\ — Qn. Again using the independence of the edges, we have 

E[w(S)w{T)}=w 2 (a,t,») m where w 2 (a, (, fi) = E UtV [w u , v (S) w u . v {T)] . 

For each edge (u,v) of G, the probability that it has no endpoints in S or T is (1 - 2a + C) 2 , in which 
case it contributes fi 2 to w UiV (S) w UjV (T). The probability that it has one endpoint in S and none in T or 
vice versa is 2(2a — 2Q(l — 2a + (), in which case it contributes fi. Finally, the probability that it has one 
endpoint in S and one in T is 2 (a — () 2 + 2£(1 — 2a + £), in which case it contributes 1. With the remaining 
probability it has both endpoints in S or T, in causing them to be non-independent and contributing zero. 
Thus 

w 2 (a, C, fJ.) = (1 - 2a + C) V + 4(a - 0(1 - 2a + C)m + 2(a - C) 2 + 2£(1 - 2a + C) 

Observe that when ( — a 2 , as it typically would be if S and T were chosen independently and uniformly, we 
have 

w 2 = w\ . (7) 
The number of pairs of sets S, T of size an and intersection of size z — (n is the multinomial 

n \ ( n \ f an\ /(l — a)n 

(n, (a - Qn, (a - Qn, (1 - 2a + QnJ \anj \(n J \(a - Qn 

and linearity of expectation gives 

cm , s 
\z, an — z, an — z, (1 — 2a)n + zj 

This sum is dominated by the terms where £ = z/n is bounded inside the interval (0, a). Stirling's approxi- 
mation then gives 



n \ 1 

■ e 



(n, (a — ()n, (a — Qn, (1 — 2a + ()n J n 3 / 2 



n[h(a)+ah(C/a)+(l-a)/i(f^)] ^ (g) 



where ~ hides constants that vary slowly with a and (. Thus the contribution to E[A 2 ] of pairs of sets with 
overlap £ € (0, a) is 



J_ e nA(a,C,M) where / 2 ( a ,C,M) = Ma) + ^(£) |) +i ln 



w 2 (a,C,/z). (9) 
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Combining this with (JBJ , we can write 



E[X 2 } _ 1 g e „ 0(z/n); (1Q) 



where 



E[*] 2 Vn^„ 



.,(•,)=/,(„.,.//)- 2/j in. /( | = m/,( a)h(^-4) - h(a) + | In 



J \1 — a J 2 wi(a,/x) 

Using ([7]) and the fact that the entropy terms cancel, we have 

(j)(a 2 ) = 0. 

In other words, the contribution to E[X 2 ] from pairs of sets with overlap a 2 is proportional to E[X] 2 
We can now replace the sum in (|10p with an integral, 



l[X 2 } 1 



E[X} 2 



-, an r a 

-LYe^'M / e'^dC, 



and evaluate this integral using Laplace's method as in [6j Lemma 3]. Its asymptotic behavior depends on 
the maximum value of d>, 



Wx = max 0(C) 
Ce[0,a] 



If 0" < at the corresponding £ max , then it is dominated by an interval of width Q{\/ y/n) around Cmax and 



E[X] 2 



c 



If 0max = '/'(a 2 ) = 0, then E[X 2 ] ~ E[JT] 2 and the second moment method succeeds. Thus our goal is to 
show that is maximized at a 2 . 

For this to happen, we at least need C = a 2 to be a local maximum of 0. In particular, we need 

<//(« 2 ) = o. (ii) 

Differentiating, we find that ((TTj) holds if 

1 - 2a 
1 — a 

Henceforth, we will fix fj, to this value. In that case we have 

w% = 1 — a and Wi = (1 — a) 2 + -j- r=- , 

(1 — a) z 

so 

0(C) = a*(i) + (1 - a) fc (2Lz|) - .(a) + £ ln(l + 

The remainder of this paper is dedicated to showing that for sufficiently small a as a function of c or vice 
versa, is indeed maximized at a 2 . 
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3 Finding and bounding the maxima 

Using ln(l + x) < x, we write 0(C) < V'(C) where 

HO - afc(i) + (1 - «) *(^|) - h(a) + f . (12) 

Note that 

V>(a 2 ) = <j)(a 2 ) = 0. 

Our goal is to show for an appropriate c that ( = a 2 is in fact the global maximum of ifr, and therefore of 
<p. In what follows, asymptotic symbols such as O and o refer to the limit a — >• 0, or equivalently the limit 
c — > oo. Error terms may be positive or negative unless otherwise stated. 
The first two derivatives of tp(() are 

^'(0 = C „ ~ °Q + 2 ln(a - C) - In C - In(l - 2a + C) (13) 
(1 - a) 4 

«)=a^?-^c-rTOc (14) 

The second derivative ^"(C) tends to — oo at ( = and £ = a. Setting ifj"(() = yields a cubic equation in 
£ which has one negative root and, for sufficiently small a, two positive roots in the interval [0,a]. Thus for 
each a and sufficiently small a, there are < d < ( 2 < ce where 

' < < C < Ci 

V>"(0 < > o Ci < C < C2 

_ < ( 2 <(<a. 

It follows that ip can have at most two local maxima. One is in the interval [0,Ci], and the following 
lemma shows that for the relevant a and c this is a 2 : 

Lemma 3. If c= o(l/a 2 ) then for sufficiently small a, (i > a 2 and ip{a 2 ) is a local maximum. 

The other local maximum is in the interval [C2,a], and we denote it ( 3 . To locate it, first we bound ( 2 : 

Lemma 4. If 

. . . ln(l/a) 
c= (2 + o 1 ))-^-^, 

a 

then 

^ 2 1 A h X 1±°0) 

— = 1 — 02 where 2 = - — j-—, — : . 
a ln(l/a) 

Thus Qija., and therefore C3/ a ; tends toward 1 as a — > 0. 
We can now locate £3 when a is close to its critical value. 

Lemma 5. If 

c= -(21n(l/a) + 2-o(l)) , 

C3 r , - 1+0(1) r- 

— = 1 — o 3 where o 3 = \f a . 

a e 

Lemma 6. For any constant x > 4/e, i/ 

_ 21n(l/a) + 2 - i^a 
a 

i/ien VKC3) < for sufficiently small a. 
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4 Proofs 

Proof of Lemma [3 Setting Q = o? in (IT4l) gives 

If c = o(l/a 2 ) this is negative for sufficiently small a, in which case £i > a 2 and ^{a 2 ) is a local maximum. 

□ 

Proof of Lemma [^J For any constant b, if 

c ^ 

— = I — 5 where 6 = —. — j—r (15) 
a m(l/a) 



-O(l/o). 



then UJJ gives 

«)=(2-5 + „(l))MiM 

If 6 ^ 1, for sufficiently small a this is negative if b < 1 and positive if 6 > 1. Therefore C^Iol^ 1 — S3 where 
* 3 = (l + o(l))/ln(l/a). □ 

Proof of Lemma\^ Lemma |4] tells us that (3 = a(l — 5) for some 

l + o(l) 
ln(l/a) ' 

Setting ( = a(l — S) in ITU)) and using 

- — — — T = 1 + 0(a) and - ln(l - x) = 0(x) 
(1-a) 4 

gives, after some algebra, 

ip'(C) = ac + ln a + 2 ln 5 + 0{a5c) + 0(a 2 c) . 

For any constant b, setting 

d = 

e 

gives 

V/(C) = ac + 21na + 21n&-2 + C>(a 3/2 c), 

and setting 

21n(l/a) + 2-£ 

c = 

a 

then gives 

^'(C) = 21n6-e + o(l). 

If e = o(l) and 6^1, for sufficiently small a this is negative if b < 1 and positive if b > 1. Therefore 
Cci/a = 1 — (53 where (53 = (1 + o(l))y/a/e. □ 

Proof of Lemma\^ Setting £ = a(l — 8) where (5 = b^fa/e in (|12[) and using the Taylor series 

7 — l — j = l + 4a + 0(a 2 ) and - ln(l - x) = x + x 2 12 + 0(x 3 ) 
(1-a) 4 
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gives, after a fair amount of algebra, 

V,(C) = a(lna - 1) - { j a 3 / 2 + ^_ - ^ j a 2 

_ * a 5/2 c + + -A a 3 c + 0(a 7/2 c) + 0(a 5/2 } 

e \ 2c z J 

Setting 

21n(l/a) + 2 - xJa 

c = 

a 

for constant x causes the terms proportional to a In a, a, and a 3 / 2 In a to cancel, leaving 



HO 



(^i-M-|) a 3 / 2 + (« 2 ) 



The coefficient of a 3 / 2 is maximized when 6 = 1, and is negative whenever x > 4/e. In that case, VKC3) < 
for sufficiently small a, completing the proof. □ 

Proof of Corollary [H First note that 

2 /ec\ , , 

oo = -W{j) (16) 

is the root of the equation 

, _ „ Mi/op) + 1 

c — z , 

since we can also write it as 

and multiplying both sides by gives 

ca o p c aa /2 _ 21 
2 2 ' 

in which case (ITBl follows from the definition of IF. 
The root a of 

ln(l/a) + l x 



is then at least 



c = 2- 

a J a 



9c V 21n< 

since a = (1 + o(l))21nc/c and d 2 ao/d 2 c > 0. Since 

^ = -( 1 + ° (1) )^ 2 -' 

the statement follows from Theorem [TJ □ 
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Abstract. We prove new lower bounds on the likely size of the maxi- 
mum independent set in a random graph with a given constant average 
degree. Our method is a weighted version of the second moment method, 
where we give each independent set a weight based on the total degree 
of its vertices. 

1 Introduction 

We are interested in the likely size of the largest independent set S in a random 
graph with a given average degree. It is easy to see that = 0(n) whenever the 
average degree is constant, and Shamir and Spencer [8] showed using Azuma's 
inequality that, for any fixed n, \S\ is tightly concentrated around its mean. 
Moreover, Bayati, Gamarnik, and Tetali [3] recently showed that \S\/n converges 
to a limit with high probability. Thus for each constant c there is a constant 
acrit = a crit (c) such that 



lim Pi[G(n,p = c/n) has an independent set of size an] = < ' ' 

a > a crit . 



By standard arguments this holds in G(n,m = cn/2) as well. 

Our goal is to bound a C rit as a function of c, or equivalently to bound 



as a function of a. For c < e, a greedy algorithm of Karp and Sipser [7] asymp- 
totically finds a maximal independent set, and analyzing this algorithm with 
differential equations yields the exact value of cv C rit- For larger c, Frieze [6] deter- 
mined Q cr it to within o(l/c), where o refers to the limit where c is large. These 
bounds were improved by Coja-Oghlan and Efthymiou [5] who prove detailed 
results on the structure of the set of independent sets. 

We further improve these bounds. Our method is a weighted version of the 
second moment method, inspired by the work of Achlioptas and Peres [2] on 
random fc-SAT, where each independent set is given a weight depending on the 
total degree of its vertices. In addition to improving bounds on this particular 
problem, our hope is that this advances the art and science of inventing random 
variables that counteract local sources of correlation in random structures. 




Ccrit = sup{c : a-crit(c) > a} , 



We work in a modified version of the G(n, m) model which we call G(n, m). 
For each of the m edges, we choose two vertices u, v uniformly and independently 
and connect them. This may lead to a few multiple edges or self-loops. A vertex 
with a self-loop cannot belong to an independent set. In the^sparse case where 
m = cn/2 for constant c, with constant positive probability G(n,m — cn/2) has 
no multiple edges or self-loops, in which case it is uniform in the usual model 
G(n,m — cn/2) where edges are chosen without replacement from distinct pairs 
of vertices. Thus any property which holds with high probability for G(n,m) 
also holds with high probability for G(n, m), and any bounds we prove on a cr ;t 
in G(n, m) also hold in G(n, m). 

We review the first moment upper bound on a cr i t from Bollobas [4]. Let X 
denote the number of independent sets of size an in G(n, m). Then 

PrLY > 0] < ELY] . 

By linearity of expectation, ELY] is the sum over all (Jjj sets of an vertices of 
the probability that a given one is independent. The m edges (u, v) are chosen 
independently and for each one u,v € £ with probability a 2 , so 

ELY] = ( H Vl - a 2 ) m . 
\an J 

In the limit n — > oo, Stirling's approximation n\ = (1 + o(l))v / 27mn" e _n gives 

(;„)= e (^ e * , )• (i) 

where h is the entropy function 

h(a) = — a\na — (1 — a) ln(l — a) , 
and where 6 hides constants that depend smoothly on a. Thus 

ELY] =e^=e n( - h ^ + ^ ln ^-^ 

For each c, the a such that 

h(a) + (c/2) ln(l - a 2 ) = (2) 

is an upper bound on a cr i t (c), since for larger a the expectation ELY] is expo- 
nentially small. 

We find it more convenient to parametrize our bounds in terms of the function 
Ccrit(a). Then (2) gives the following upper bound, 

a lna + (l-a)ln(l-a) \n(l/a) + 1 

Ccrit(a) < 2 t— < 2 . (3) 

ln(l - a z ) a 



We will prove the following nearly-matching lower bound. 



Theorem 1.1. For any constant x > 4/e, for sufficiently small a 

c clit (a)>2 l ^M±l . (4) 

a ya 

Coja-Oghlan and Efthymiou [5] bounded c cr ;t within a slightly larger factor 
0(Vln(l/a)/a). 

Inverting (3) and Theorem 1.1 gives the following bounds on a C rit(c). The 
lower bound is a significant improvement over previous results: 

Corollary 1.2. For z > 0, let W(z) denote the unique positive root x of the 
equation xc x = z. Then for any constant y > 4\/2/e, 

2 /ec\ Vine 2 /ec\ 

w/iere ifte /ower bound holds for sufficiently large c. 
If we like we can expand W(ec/2) asymptotically in c, 



W 



(^pj = In c - In In c + 1 - In 2 + 



lnlnc l-ln2 
In c In c 



1 (lnlnc) 2 , , N lnlnc 3 + (ln2) 2 - 41n2 / (lnlnc) 5 



2 (lnc) 2 v '(lnc) 2 2(lnc) 2 V (lnc) s 

The first few of these terms correspond to the bound in [6] , and we can extract 
as many additional terms as we wish. 



2 The weighted second moment method 

Our proof uses the second moment method. For any nonnegative random variable 
X , the Cauchy-Schwarz inequality implies that 

r<[*>o]>HW. (5) 

If X counts the number of objects of a certain kind, (5) shows that at least one 
such object exists as long as the expected number of objects is large and the 
variance is not too large. 

Unfortunately, applying this directly to the number X of independent sets 
fails utterly. The problem is that for most pairs of sets of size an, the events that 
they are independent are highly correlated, unlike the case where the average 
degree grows sufficiently quickly with n [4, 6]. As a result, E[X 2 ] is exponentially 
larger than E[X] 2 , and the second moment method yields an exponentially small 
lower bound on Pr[X > 0]. 

One way to deal with these correlations, used by Frieze in [6], is to partition 
the vertices into sets Vi of [l/a\ vertices each and focus on those independent 



sets that intersect each V{ exactly once. In that large-deviations inequality 

allows us to override the correlations. 

Here we pursue a different approach, inspired by the work of Achlioptas and 
Peres [2] on random fc-SAT. The idea is to give each independent set S a weight 
w(S), depending exponentially on local quantities in the graph. Specifically, we 
define 

/ q\ ,,# of edges (u, v) with u,v $ S 

W \ D ) — P ! 

for some < 1. If the number of edges m is fixed, the number of edges where 
neither endpoint is in S is simply m minus the total degree of the vertices in S. 
Thus we can also write 

W (S) = M m -£. es dcg(^ ( 6 ) 

We will apply the second moment method to the total weight of all indepen- 
dent sets of size an, 

x= w ( s y 

SCV,\S\=an 
S independent 

If we tune [i properly, then for particular a*,c* we have ELY 2 ] = 8(E[A] 2 ), 
in which case Pr[A > 0] is bounded above zero. In that case c cr ;t(a*) > c*, or 
cquivalently a cr ; t (c*) > a*. 

Why is this the right type of weight? Intuitively, one of the main sources of 
correlations between independent sets is the temptation to occupy low-degree 
vertices. For instance, any two maximal independent sets contain all the degree- 
zero vertices, giving them a large overlap. If X simply counts the independent 
sets of size an, the resulting correlations make X's variance exponentially large 
compared to the square of its expectation, and the second moment fails. 

Weighting each S as in (6) counteracts this temptation, punishing sets that 
occupy low-degree vertices by reducing their weight exponentially. As we will see 
below, when /i is tuned to a particular value, making this punishment condign, 
these correlations disappear in the sense that the dominant contribution to E[A 2 ] 
comes from pairs of sets S, T of size an such that \S fl T\ = a 2 n + 0(y/n), just 
as if S and T were chosen independently from among all sets of size an. 

This is analogous to the situation for fc-SAT, where satisfying assignments 
are correlated because of the temptation to give each variable the truth value 
that agrees with the majority of its literals in the formula. By giving each sat- 
isfying assignment a weight r(& of tmc lltcrals and tuning n properly, we make the 
dominant contribution to E[X 2 ] come from pairs of satisfying assignments which 
agree on n/2 + 0(y/n) variables, just as if they were chosen independently [2]. 

Proceeding, let us compute the first and second moments of our random 
variable X. We extend the weight function w(S) to all sets S C V by setting 
w(S) = if S is not independent. That is, 

X= £ w(S) 
scy 

\S\=an 



where 

W (s)= n w -as) 

(u,v)eE 

and 

jj, if u, v S 

w u , v (S) = < 1 if u e S,v ^ S or vice versa (7) 
if u, v e S . 

We start by computing ELY]. Fix a set S of size an. Since the to edges are 
chosen independently, 

E[w(S)] =wi(a, u) m where wi (a, n) = E u . v [w UtV (S)} . 

For each edge (u, v) in G(n, to), u and i> are chosen randomly and independently, 
so the probabilities of the three cases in (7) arc (1 — a) 2 , 2a(l — a), and a 2 
respectively. Thus 

w\(a, u) = (1 — a) 2 u + 2a(l — a) . 
By linearity of expectation, 



SQV 
\S\=an 



Using Stirling's approximation (1) and substituting m = cn/2 gives 

E[X] = e^-L e nfl( - a ^ where /i(a) = h{a) + | In Wl (a, n) . (8) 

As before, hides constant factors that depend smoothly on a. 
Next we compute the second moment. We have 



ELY 2 ] = E 



5>(S)^>(T) 



where S and T are subsets of V of size an. The expectation of w(S') w(T) does 
not depend on the specific choice of S and T, but it does depend on the size of 
their intersection. We say that S and T have overlap ( if |S(~lT| = £n. Again 
using the independence of the edges, we have 

E[w(S)w(T)} =w 2 (a,(,n) m where w 2 (a, (, u) = E u , v [w u , v (S) w UyV (T)} . 

For each edge (u, v) of G, the probability that it has no endpoints in 5 or T is 
(1 — 2a + C) 2 , in which case it contributes /x 2 to w u ^ v (S) w u . v (T). The probability 
that it has one endpoint in S and none in T or vice versa is 2(2a — 2£)(1 — 2a + C)> 
in which case it contributes a. Finally, the probability that it has one endpoint 



in S and one in T is 2(a — () 2 + 2((1 — 2a + Q, in which case it contributes 
1. With the remaining probability it has both endpoints in S or T, in causing 
them to be non-independent and contributing zero. Thus 

w 2 (a,C,fi) = (l-2a + C)V+4(a-C)(l-2a + C)^ + 2(a-C) 2 +2C(l-2a + C) 

Observe that when £ = a 2 , as it typically would be if S and T were chosen 
independently and uniformly, we have 

u>2 = w\ ■ (9) 

The number of pairs of sets S, T of size an and intersection of size z = (n is 
the multinomial 

n \ / n \ ( an\ /(l — a)n 

(n, (a — Qn, (a — Qn, (1 — 2a + Qn J \an J \(n J \(a — ()n 

and linearity of expectation gives 

cm , s 

^—^ \z, an — z, an — z, (1 — 2a)n + z) 

This sum is dominated by the terms where £ = z/n is bounded inside the interval 
(0,a). Stirling's approximation then gives 

n \ / c n[h( a )+ a h(c/ a )+^-a)h{^)]\ 



£n, (a — Qn, (a — (,)n, (1 — 2a + ()n J \ n 3 / 2 ; ' 

where O hides constants that vary slowly with a and £. Thus the contribution 
to E[X 2 ] of pairs of sets with overlap ( e (0, a) is 

' e n/2(a,C,M) (10) 



n 3/2 

where 



f2(a,(,v) = h(a) +ah(^j + (1 - a)h(^— + | In w 2 {a, C, n) . 

Combining (10) with (8), we can write 

E[X 2 
E[X] 

where 



0(O = /2(a,C,/*)-2/i(a,/i) 

= afcf ^ + (1 - a) h( h(a) + °- In C ' ^ 



a/ \1 — a J 2 wi(a,fi) 2 



Using (9) and the fact that the entropy terms cancel, we have 

<H<* 2 ) = o. 

In other words, the contribution to E[Y 2 ] from pairs of sets with overlap a 2 is 
proportional to ELY] 2 . 

We can now replace the sum in (11) with an integral, 



ELY 



21 



ELY] 2 



and evaluate this integral using Laplace's method as in [1, Lemma 3]. Its asymp- 
totic behavior depends on the maximum value of <f), 



^max = max 0(C) . 
Ce[o,«] 

If 4>" < at the corresponding (max, then it is dominated by an interval of width 
Q(l/y/n) around (max and 

ELY] 2 K > ■ 

If 0max = 4 ) { a ' 2 ) = 0, then ELY 2 ] = 0(E[Y] 2 ) and the second moment method 
succeeds. Thus our goal is to show that <fi is maximized at a 2 . 

For this to happen, we at least need ( = a 2 to be a local maximum of <f). In 
particular, we need 

0V) = 0. (12) 
Differentiating, we find that (12) holds if 

1 - 2a 

M = -j • 

1 - a 

Henceforth, we will fix \i to this value. In that case we have 



wi = 1 - a and w 2 = (1 - a) 2 + ^ — , 

(1 — ay 

so 

m -<*(£)+ o - - m + + 

The remainder of this paper is dedicated to showing that for sufficiently small 
a as a function of c or vice versa, 6 is indeed maximized at a 2 . 



3 Finding and bounding the maxima 



Using ln(l + x) < x, we write (f>(Q < ip(Q where 

HO = + (1 - a) h(S£±) - h(a) + C Jj^ . (13) 

Note that 

V>(a 2 ) = 4>{a 2 ) = 0. 

Our goal is to show for an appropriate c that £ = a 2 is in fact the global 
maximum of ip, and therefore of <j>. In what follows, asymptotic symbols such as 
O and o refer to the limit a — > 0, or equivalently the limit c — > oo. Error terms 
may be positive or negative unless otherwise stated. 
The first two derivatives of are 

c(C- a 2 ) 

^(0 = ( \ _ a) 4 + 2 ln (« - - In C - ln(l -2a + C) (14) 

^ a R-^Tiic (15) 

The second derivative ^"(C) tends to — oo at £ = and £ = a. Setting ^"(C) = 
yields a cubic equation in ( which has one negative root and, for sufficiently small 
a, two positive roots in the interval [0, a]. Thus for each a and sufficiently small 
a, there are < Ci < C2 < ca where 

< < c < Ci 
<i •<) Ci<C<C 2 

< C2 < C < « • 

It follows that ip can have at most two local maxima. One is in the interval 
[0, Ci] j and the following lemma shows that for the relevant a and c this is a 2 : 

Lemma 3.1. Ifc = o(l/a 2 ) then for sufficiently small a, £i > a 2 and ip(a 2 ) is 
a local maximum. 

The other local maximum is in the interval [C2 , ce] , and we denote it £3 . To locate 
it, first we bound 



Lemma 3.2. If 



then 



ln(l/a) 

c= (2 + o(l)) 



Q 



— = 1 — 02 where d 2 = , , . . 
a m(l/a) 

Thus (2/ 01, and therefore (3 /a, tends toward 1 as a — > 0. 
We can now locate £3 when a is close to its critical value. 



Lemma 3.3. If 

c= -(21n(l/a) + 2-o(l)) , 
a 

then 

— = 1 — d 3 where d 3 = \f a . 

a. e 

Lemma 3.4. For any constant x > 4/e, if 

21n(l/a) + 2-Xy/a 

c = , 

a 

then VKC3) < for sufficiently small a. 
4 Proofs 

Proof of Lemma 3.1. Setting ( = a 2 in (15) gives 



2 



If c = o(l/ a 2 ) this is negative for sufficiently small a, in which case £1 > a 2 and 
ip(a 2 ) is a local maximum. □ 

Proof of Lemma 3.2. For any constant b, if 

- = 1 - S where 6 = - b , - (16) 

a ln(l/a) 

then (15) gives 

^(0= (2-^+o(l)) ^-0(l/a). 

If 6 7^ 1, for sufficiently small a this is negative if b < 1 and positive if 6 > 1. 
Therefore C 2 /« = 1 - 63 where <5 3 = (1 + o(l))/ln(l/a). □ 

Proof of Lemma 3.3. Lemma 3.2 tells us that £3 = ct(l — 8) for some 

l + o(l) 
ln(l/a) ' 

Setting £ = a(l — (5) in (14) and using 

- — - — -j = 1 + 0(a) and - ln(l - x) = 0{x) 
(1 - a) 4 

gives, after some algebra, 

V>'(C) = <*c + In a + 2 InJ + O(aJc) + 0(a 2 c) . 



For any constant b, setting 

e 

gives 

ip'(Q = ac + 21na + 21n6-2 + 0(a 3 / 2 c), 

and setting 

21n(l/a) + 2-£ 

c = 

a 

then gives 

V>'(C) =2In6-e + o(l). 

If e — o(l) and b ^ 1, for sufficiently small a this is negative if b < I and positive 
if b > 1. Therefore (3/0: = 1 - 5 3 where <5 3 = (I + o(l)) v / o/e. □ 

Proof of Lemma 3.4-. Setting £ = a(l — 5) where S = b^/a/e in (13) and using 
the Taylor series 

- — l —- = l + Aa + 0{a 2 ) and - ln(l - x) = x + x 2 /2 + 0{x 3 ) 
(1 - a) 4 

gives, after a fair amount of algebra, 

/261na- 46 + 261n6\ 3/2 /c+1 6 2 \ 2 
V-(C) = a(Ina - 1) - ^ J ^ 2 + ^— - a 2 

- ^ a 5/2 c + + l) <* 3 c + 0(« 7/2 c) + 0(a 5/2 ) . 

Setting 

_ 21n(l/a) + 2 - z^S? 
a 

for constant x causes the terms proportional to a In a, a, and o?l 2 In a to cancel, 
leaving 

V>(0 



( 2b(1 ; ln&) -|)^ 2 + o ( . 2 ). 



The coefficient of a 3 / 2 is maximized when 6=1, and is negative whenever 
x > 4/e. In that case, V(Ca) < for sufficiently small a, completing the proof. 

□ 

Proof of Corollary 1.2. First note that 

a = -W(^) (17) 

is the root of the equation 

ln(l/a ) + l 



since we can also write it as 

e 



ca/2 _ C 



a 

and multiplying both sides by cao/2 gives 



c ca /2 



2 ' 



in which case (17) follows from the definition of W. 
The root a of 

Wl/aH-l x 

is then at least 



- w (-)+(.+ (l))^ 



c V 2 / v y " dc V 21nc 
since a = (1 + o(l))21nc/c and d 2 ao/d 2 c > 0. Since 



ctao /- ,. a 21iic 



■(1 + (1))- 



the statement follows from Theorem 1.1. □ 
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